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LONG-TERM  GOALS 

Our  proposed  research  will  focus  on  the  horizontal  underwater  acoustic  communications  (UAC). 
Building  on  co-located  and/or  distributed  multiple  transducers  and  hydrophones,  we  plan  to  develop 
multiple-input  multiple-output  (MIMO-)UAC  systems  to  provide  range/rate  enhancement. 

OBJECTIVES 

Nearly  all  underwater  communication  scenarios  encounter  the  asymmetry  issue,  since  the 
communicating  nodes  include  small  size  sensors,  autonomous  underwater  vehicles  (AUV),  large  size 
underwater  sinks  and  large  vessels  such  as  ships  and  submarines.  Our  research  aspires  to  design 
M1MO  systems  addressing  needs  of  both  sides  of  this  asymmetry.  Specifically,  for  the  error-sensitive 
command-disseminating  communication  links,  we  design  robust  MIMO  strategies  that  emphasize  on 
the  transmitter  (command  center)  processing  while  providing  superb  error  performance  even  in 
turbulent  sea  and  allowing  for  very  simple  receivers  (sensor  or  AUV).  For  the  data-centric  information 
collecting  communication  links,  we  design  high  speed  MIMO  schemes  that  place  most  of  the 
complexity  and  processing  on  the  receiver  (information  sink)  side,  while  allowing  the  sensors  to 
transmit  data  at  highest  possible  rate  without  much  processing.  In  the  latter  scenario,  we  also 
investigate  the  cooperative  communications  among  distributed  sensors. 

APPROACH 

1.  Robust  MIMO-UAC  with  Low-Complexity  Receivers 

Under  this  research  thrust,  we  study  a  simple  channel  estimation  approach  and  a  differential 
(de)modulation  scheme.  Both  designs  are  tailored  for  low-complexity  receivers.  To  ensure  the 
robustness  of  these  approaches  against  various  UAC  channel  conditions,  we  also  analyze  effects  of 
different  doubly-selective  channel  models. 

Windowed  Leasj  Squares  Channel  Estimator  Based  on  Basis  Expansion  Model  (JBEM):  Consider  an  N- 
symbol  transmitted  block  with  P  pilot  subblocks  each  with  length  Np.  To  eliminate  the  interference 
between  the  pilots  and  the  data,  L  zeros  are  inserted  after  each  data  and  pilot  subblock.  The  system 
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diagram  is  shown  as  Fig.  1  with  detailed  steps  as  follows: 

•  Step  0:  Add  a  Window  at  the  Receiver. 

With  a  diagonal  window  matrix  W  being  multiplied  to  the  received  vector  y,  we  have: 

Wy  =  WHx  +  Wz  =  Hx  +  Wz  (1) 

where  H  is  the  windowed  channel  matrix,  x  is  the  transmitted  vector  and  z  is  noise. 


Step  1 :  Estimate  the  BEM  Coefficients  of  the  W'indowed  Channel. 
Stacking  the  windowed  received  symbols  from  the  pilots,  we  obtain 


WPyP  =  Y, 
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2K0  =  +  WPzP  =  QkHJk  +  ^n-kHn-k  +  WpZp 

DilPUp 

where  Dip  has  the  t-th  column  of  the  inverse  DFT  matrix  on  the  diagonal,  Up  is  the  pilot  matrix, 
subscript  P  means  pilots,  and  subscripts  K  and  N  —  K  mark  the  low  frequency  submatrix  (BEM) 
and  the  high  frequency  one  (model  fitting  bias),  respectively.  We  obtain  the  BEM  coefficients  as 


(2) 
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•  Step  2:  De-Window  and  Recover  the  Channel  Coefficients. 

We  also  design  the  optimum  pilot  pattern  by  evaluating  the  mean  square  error  (MSE)  of  the  channel 
estimate.  In  addition,  we  devise  a  sliding  window  approach  and  only  retain  the  estimates  at  the  center 
of  each  window  location.  The  resultant  optimum  pilot  pattern  is  shown  in  Fig.  2. 


Deferential  Ml  MO-  UA  C  and_  Effects  of  Doublv-Selective  Channel  Models :  In  [2],  we  proposed  a 
differential  MIMO  scheme  using  the  time-invariant  equivalent  channel  based  on  DFT  BEM.  Following 
similar  steps,  we  obtain  the  time-invariant  equivalent  I/O  within  a  block  based  on  arbitrary  BEMs  as: 


yip)  =  Gu(p)  +  n  1(p)(E(p)u(p)  +  z(p))  (4) 


where  p  is  the  subblock  index,  u(p)  is  the  Mxl  transmitted  subbolck,  a(p)  is  the  model  fitting  bias 
matrix,  z(p)  is  the  noise,  G  is  the  equivalent  time-invariant  channel  equivalent  we  want  and  /2(p)  is: 
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The  KxK  equally-spaced  decimated  basis  matrix  Bn  links  the  K  channel  coefficients  to  K  BEM 
coefficients:  g(l )  =  B^1/ln(/)  —  B^1  t]n0)  >  where  hn(l)  is  the  K  x  1  equally-spaced  decimated 
channel  vector  and  gn(l)  is  the  corresponding  model  fitting  bias  vector.  For  DFT  BEM.  Bn  =  N /KB„ 
is  a  scaled  unitray  matrix,  while  for  DPSS  BEM  it  is  not.  Accordingly,  with  DPSS  BEM,  the  noise 
term  in  (4)  will  be  colored  and  the  model  fitting  bias  changed.  This  is  not  the  case  for  DFT  BEM 
though  its  original  model  fitting  bias  a(p)  is  greater  than  that  for  DPSS  BEM  in  the  MSE  sense. 


Such  a  transform  between  channel  coefficients  and  BEM  coefficients  is  involved  in  all  BEM-based 
systems.  For  our  differential  system,  BEM  is  used  to  construct  the  time-invariant  equivalent  channel 
per  block;  whereas  for  coherent  schemes,  BEM  coefficients  are  estimated  to  recover  the  channel 
coefficients.  In  our  simple  WLS  channel  estimator  [4],  the  channel  estimation  MSE  is: 

MSE  =  E{\\h  -  h||2}  =  ElWW-'tf  -  BB^(gK  +  ^z„)]||2}  (6) 

where  r\  is  the  model  fitting  bias  after  windowing.  Since  DPSS  BEM  approximates  the  true  channel 
more  precisely  than  DFT  BEM,  the  norm  of  the  model  fitting  bias  g  and  gK  of  DPSS  BEM  in  (6)  is 
smaller  than  that  of  DFT  BEM  [1].  However,  for  DPSS  BEM,  the  noise  will  be  colored  since  BK  is  not 


unitary.  This  is  clearly  not  the  case  for  DFT  BEM.  Therefore,  there  is  again  a  tradeoff  between  the 
modeling  accuracy  and  the  noise  coloring  effect,  as  in  the  differential  case. 

2.  Enhanced  Channel  Estimation  and  Symbol  Detection  for  High  Speed  MIMO-UAC 

We  address  two  key  issues  regarding  the  design  of  a  high  speed  MIMO-UAC  system,  namely  channel 
estimation  and  symbol  detection.  To  enhance  channel  estimation  performance,  we  propose  a  cyclic 
approach  (CA)  for  designing  training  sequences  and  an  iterative  adaptive  approach  (1AA)  for 
estimating  the  channel  taps.  Regarding  symbol  detection,  we  present  a  minimum  MSE  (MMSE)  based 
detection  scheme,  called  RELAX-BLAST,  which  combines  vertical  Bell  Labs  Layered  Space-Time 
(V-BLAST)  algorithm  and  the  cyclic  principle  of  RELAX. 


Training  Sequence  Design :  Consider  phase  coherent  communications  over  TV* M  MIMO-UAC 
multipath  channels.  For  multipath  channels,  training  sequences  with  good  auto-  and  cross-correlation 

properties  are  required.  Training  symbols  have  a  generic  form  of xn(t)-el*n(l) ,  where^„(0  e  [0,2/r)  is 
the  phase  of  the  training  symbol  x„(l )  sent  by  the  n  transmitter,  n  =  \,2,...,N  and  t  =  1,2 P  with 
P  being  the  total  training  length.  Let  X  e  c(R+R-'l')xNR  contain  TV  training  sequences  and  their  delayed 
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where  |  |j  denotes  the  Frobenius  matrix  norm,  and  I  is  an  identity  matrix.  Let  U  be  an  arbitrary  semi¬ 
unitary  matrix.  Then  minimizing  e  can  be  formulated  in  the  following  related  way 


M,(0}  =argmin 
{*„<»)}, u’ 


X-VPU*  ‘  ,  subject  to  UU*  = 


This  optimization  problem  can  be  solved  by  a  cyclic  approach  (CA)  as  follows: 
•  Step  0:  U  is  initialized  to  be  [l/V2(I  +  /I)  o]. 


(7) 


(  K 


Step  1 :  With  most  recent  U* ,  the  solution  to  (7)  is  <f>  -  arg  ^  zr  ,  where  ( are  given  numbers. 


Vr=I  J 


•  Step  2:  With  most  recent^(/),  the  solution  to  (7)  is  given  by  U*  =  UU* ,  where  JpX  =  UrO*  is  the 

singular  value  decomposition  (SVD)  of  yfpX  .  Iterate  Steps  1  and  2  until  the  difference  of  the  cost 
function  in  (7)  between  two  successive  iterations  becomes  lower  than  a  predefined  threshold. 


The  Channel  Estimation  Algorithm  ( MAli  The  second  phase  of  channel  estimation  involves  the  design 
of  the  algorithm  that  will  estimate  the  channel  impulse  response  (CIR),  denoted  as  h,  using  the  training 
sequences  (or  the  previously  detected  symbols)  S,  and  highly  contaminated  measurements  y.  The 
channel  estimation  problem  at  each  receiver  has  the  generic  form:  y=Sh+e,  where  e  represents  the 
additive  noise.  We  consider  IAA  for  estimating  h.  IAA  makes  no  assumptions  on  the  statistical 
properties  of  e.  Let  P  be  a  diagonal  matrix  whose  diagonal  elements  are  Pr=\hrf,  where  hr  represents 
the  rlh  element  of  h.  If  the  rh  column  of  S  is  written  as  sr,  then  the  IAA  can  be  implemented  as: 

•  Step  0:  Initialize  Pr  =  |s*yj  /  (s*s)  . 

•  Step  1 :  With  most  recent  P,  Calculate  R=SPS*. 


•  Step  2:  With  most  recent  R,  Update  hr 


.  Iterate  Steps  1  and  2  until  convergence. 


Sparse  channel  estimates  can  be  obtained  by  combining  IAA  with  the  Bayesian  informaiton  criterion 
(B1C).  Moreover,  the  RELAX  algorithm  can  be  used  to  improve  the  IAA  with  B1C  estimates  further. 


Symbol  Detection :  Following  the  C1R  estimation  is  the  design  of  the  detection  scheme  for  estimating 
the  payload  symbols.  We  use  a  MMSE  based  filter  for  symbol  detection,  and  propose  the  RELAX- 
BLAST  algorithm.  RELAX-BLAST  first  detects  the  symbol  with  the  dominant  channel  taps  and 
subtracts  it  from  the  measurements.  Then,  it  estimates  the  next  dominant  symbol  from  the  residue 
signal.  Then  the  two  already  detected  symbols  are  updated  in  an  iterative  manner  until  the  difference  of 
the  RELAX-BLAST  estimates  between  two  successive  iterations  becomes  less  than  a  certain  threshold. 
Once  these  two  symbols  are  subtracted  from  the  measurements  and  the  third  strongest  symbol  is 
estimated,  the  three  symbols  are  again  updated  in  an  iterative  manner  until  all  three  estimates  do  not 
improve  anymore.  This  process  is  repeated  until  all  A  symbols  are  detected  and  updated. 

3.  Distributed  MIMO  Communications  with  Differential  (De)Modulation 


Relay  networks  allow  a  source  to  communicate  with  a  destination  via  a  number  of  relay  nodes.  By 
forming  virtual  antenna  arrays  in  a  cooperative  manner,  spatial  diversity  gain  can  be  achieved  without 
imposing  antenna  packing  limitations.  Most  previous  work  on  relay  networks  focused  on  the  coherent 
modulation  assuming  availability  of  the  CS1.  To  reduce  hardware  complexity  and  communication 
overhead,  we  consider  differential  modulation  in  this  research.  We  quantify  the  average  error 
performance  in  decode-and-forward  (DF)  and  amplify-and-forward  (AF)  networks  as  follows: 
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Where  fu  *s  the  average  signal-to-noise  ratio  between  nodes  t  and  j.  We  not  only  give  these  formulas 
in  closed-form,  but  also  prove  their  convexity  in  both  energy  allocation  and  relay  locations.  Based  on 
these,  resource  optimization  can  be  performed  to  further  enhance  the  system  performance. 


WORK  COMPLETED 

In  terms  of  robust  M1MO-UAC  with  low -complexity  receivers,  we  have  established  a  simple 
windowed  LS  channel  estimator,  developed  a  differential  MIMO  scheme  obviating  channel  estimation, 
and  analyzed  the  effects  of  different  doubly-selective  channel  models  in  UAC  scenarios.  For  high¬ 
speed  M1MO-UAC,  we  have  focused  on  the  channel  estimation  and  symbol  detection  problems  in 
M1MO-UAC.  We  have  presented  the  CA  approach  for  designing  training  sequences  with  good  auto- 
and  cross-correlation  properties.  1AA  coupled  with  B1C  and  RELAX  has  been  presented  as  an 
approach  for  estimating  the  C1R.  We  also  proposed  a  new  detection  method  called  RELAX-BLAST. 
Our  proposed  schemes  are  tested  via  both  simulations  and  field  data  from  the  acoustic  communications 
experiment  (RACE’08)  conducted  by  the  Woods  Hole  Oceanographic  Institution  (WHOl). 

RESULTS 

1 .  Robust  MIMO-UAC  with  Low-Complexity  Receivers 

WLS  Channel  Estimator^  We  first  simulate  a  typical  underwater  acoustic  environment  with  carrier 
frequency  12  kHz  and  transceiver  moving  velocity  2.4  knots.  The  simulation  parameters  are  given  in 
Table  I.  Figs.  3  and  4  show  the  MSE  and  BER  comparisons  of  our  WLS  with  other  alternatives.  The 


windowed  approach  (WLS-Blackman)  outperforms  the  unwindowed  one  (WLS-Rectangular).  Unlike 
our  WLS  estimator,  the  MMSE  estimator  in  [3]  assumes  the  complete  channel  statistics.  However,  our 
simple  WLS  significantly  outperforms  [3].  To  test  the  robustness  of  our  approach,  we  selected  3  data 
frames  from  the  RACE’08  data,  during  the  roughest  periods  when  the  channels  vary  intensively.  Our 
WLS  achieves  a  remarkable  0.24%  uncoded  BER  when  the  regular  LS  estimator  fails  (~50%  BER), 
both  providing  a  data  rate  of  10.3kbps  at  1000m  Tx-Rx  distance. 

Differential  MlMQzUAQi  The  simulated  DFT  BEM  gives  consistently  lower  BER  than  DPSS  BEM  as 
in  Fig.  5.  In  RACE'08.  we  compared  with  a  plain  OFDM  system.  The  uncoded  BER  for  all  data  frames 
at  1000m  transmitter-to-receiver  distance  is  plotted  in  Fig.  6.  In  turbulent  sea  conditions,  the  plain 
OFDM  gives  an  uncoded  BER  exceeding  10%,  while  our  BEM-based  differential  schemes  consistently 
provide  around  0. 1  %.  The  BER  averaged  over  all  data  frames  is  0. 1 2%  and  0.1 3%  for  our  schemes.  It 
is  worth  noting  that  this  remarkable  performance  is  achieved  with  very  simple  differential  decoding  at 
the  receiver,  obviating  any  channel  estimation  requirement. 

2.  Enhanced  Channel  Estimation  and  Symbol  Detection  For  High  Speed  MIMO-UAC 

Consider  a  simulated  4x  1  multi-input  single-output  (M1SO)  system  with  time-invariant  channels.  Fig. 

7  shows  the  modulus  of  the  simulated  CIRs  corresponding  to  the  four  transmitters  with  ,/?=30  delay 
taps.  Fig.  8  shows  the  MSE  of  the  channel  estimates  obtained  by  MP  (matching  pursuit),  OMP 
(orthogonal  matching  pursuit),  LSMP  (least  squares  matching  pursuit)  and  1AA  with  QPSK  training 
and  CA  training.  We  can  see  that  with  the  QPSK  training,  1AA  significantly  outperforms  MP,  OMP 
and  LSMP  whereas  with  the  CA  training,  the  performance  gap  between  1AA  and  other  algorithms 
diminishes.  Next,  we  evaluate  the  BER  of  CLEAN-BLAST  and  RELAX-BLAST  for  a  4*  12  MIMO 
system.  Herein  we  name  V-BLAST  as  CLEAN-BLAST  to  emphasize  its  analogy  to  the  CLEAN 
algorithm  used  in  spectral  estimation.  Each  transmitted  package  contains  a  CA  training  sequence  with 
P=512,  a  payload  sequence  consisting  of  6000  QPSK  modulated  symbols.  1AA  is  used  for  channel 
estimation.  The  average  BERs  over  100  Monte-Carlo  trials  are  shown  in  Fig.  9.  We  observe  that 
RELAX-BLAST  shows  much  better  performance  than  CLEAN-BLAST  in  the  absence  of  severe  error 
propagation.  Experimental  results  in  RACE’08  employing  a  4^24  MIMO  system  show  that  the 
proposed  scheme  enjoys  an  average  uncoded  BER  of  0.38%  at  a  pay  load  bit  rate  of  3 1 .25  kbps  and  an 
average  coded  BER  of  0%  at  a  payload  bit  rate  of  15.63  kbps. 

3.  Distributed  MIMO  Communications  with  Differential  (De)Modulation 

In  Fig.  1 0,  we  plot  the  error  performance  contour  for  a  Fixed  total  energy  budget  of  1 5dB  and  the 
energy  consumption  contour  for  target  error  rate  of  0.1%,  for  the  differentially  modulated  AF  relay 
protocols.  We  observe  that,  counter-intuitively,  relays  off  the  direct  line  linking  the  source  and 
destination  can  outperform  those  sitting  on  this  direct  line. 

IMPACT/APPLICATIONS 

The  natural  bandwidth  limitations  of  coherent  underwater  acoustic  channel  suggest  a  technical 
breakthrough.  MIMO  signal  processing  is  a  promising  bandwidth  efficient  method  to  high  data  rate 
and  high  quality  services.  Building  on  both  coherent  and  differential  approaches,  and  considering  both 
co-located  and  distributed  transceiver  units,  our  promising  results  are  expected  to  favorably  impact 
high-rate  long-range  MIMO-UAC  designs. 


Table  I:  Simulation  parameters  for  coherent  detection 


Max.  Doppler  10  Hz 

Sym.  duration  1 /6000s 

Max.  delay  10  ms 

L  =  60 

N  =  1800 

P  =  9 

Pilot  energy  61 

Profile  s(Z)  =  e~0A'1 

Modulation  QPSK 

Equalizer  MMSE 

Figure  l:  System  diagram  for  the  coherent  detection  with  our  windowed  least  squares  channel 

estimator. 

[graph:  The  windowing  and  de-windowing  operations  are  implemented  only  at  the  receiver  and  only 
on  pilots,  affecting  neither  the  transmitted  data  pattern  nor  the  demodulator .] 


Data  Pilot  ( 1 ) 


Figure  2:  The  optimum  pilot  pattern 

[graph:  Each  pilot  subblock  consists  of  one  pilot  symbol  padded  by  L  zeros  before  and  after.] 


Figure  3:  Chanel  estimation  MSE  vs.  SNR  for  different  channel  estimators.  Single  transducer  and 

single  hydrophone  are  used. 

[graph:  Our  WLS  estimator  with  a  Blackman  window  has  the  best  performance.] 


SNR(dB) 

Figure  4:  BER  vs.  SNR  for  coherent  detection  with  different  channel  estimators.  Single  transducer 

and  single  hydrophone  are  used. 

[graph:  Our  WLS  estimator  with  a  Blackman  window  has  the  best  performance.] 
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Figure  5:  Average  BER  vs.  SNR  for  differential  OSTBC  based  on  DFT  BEM  and  DPSS  BEM.  Two 

transducers  and  one  hydrophone  are  used. 

[graph:  DFT  BEM  outperforms  DPSS  BEM  consistently.! 


Figure  6:  Uncoded  BER  for  differential  OSTBC  using  2  transducers  and  12  hydrophones  in 
RACE’ 08  at  1 000m  distance.  Zero-error  is  illustrated  as  10-5. 

[graph:  1.  The  plain  OFDM  scheme  has  much  worse  performance  than  the  BEM  based  ones.  2. 

DFT  BEM  is  slightly  better  than  DPSS  BEM] 
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Figure  7:  The  modulus  of  the  simulated  CIRs  between  the  four  transmitters  and  the  receiver  in  a 

4x1  MISO  system . 
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Figure  8:  MSE  of  the  CIR  estimates  for  a  4x1  MISO  system  using  the  QPSK  and  CA  training 
sequences  with  P=128  symbols .  Each  point  is  averaged  over  100  Monte-Carlo  trials . 
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Figure  9:  The  BERs  of  each  of  the  four  transmitted  payload  sequences  for  a  4*12  MIMO  system . 
The  training  sequences  consist  of  F=512  symbols  and  are  designed  by  the  CA  algorithm .  The 
detection  performance  of  CLEAN-BLAST  and  RELAX-BLAST  are  compared  in  terms  of  BER 
averaged  over  100  independent  Monte-Carlo  trials  for  varying  levels  of  the  noise  variance . 
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Figure  10:  (a)  Error  Performance  Contour  and  (b)  Energy  Consumption  Contour  of  a  2-relay 
cooperative  system.  Differentially  modulated  AF protocol  is  used  with  total  energy  constraint  of 

ISdB  in  (a)  and  target  error  rate  of  0.1%  in  (b). 

[graph:  1.  Some  relays  off  the  direct  source-destination  line  can  perform  better.  2.  The  relays  from 
the  inner  loop  should  be  chosen.  3.  The  relays  closer  to  the  destination  receive  more  benefits.] 
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